Discovering Word Meanings Based on Frequent Termsets
نویسندگان
چکیده
Word meaning ambiguity has always been an important problem in information retrieval and extraction, as well as, text mining (documents clustering and classification). Knowledge discovery tasks such as automatic ontology building and maintenance would also profit from simple and efficient methods for discovering word meanings. The paper presents a novel text mining approach to discovering word meanings. The offered measures of their context are expressed by means of frequent termsets. The presented methods have been implemented with efficient data mining techniques. The approach is domainand language-independent, although it requires applying part of speech tagger. The paper includes sample results obtained with the presented methods.
منابع مشابه
Clustering Web Documents based on Efficient Multi-Tire Hashing Algorithm for Mining Frequent Termsets
Document Clustering is one of the main themes in text mining. It refers to the process of grouping documents with similar contents or topics into clusters to improve both availability and reliability of text mining applications. Some of the recent algorithms address the problem of high dimensionality of the text by using frequent termsets for clustering. Although the drawbacks of the Apriori al...
متن کاملMeaning of “the Right Imam” based upon the Holy Quran’s Verses
The concept of “the Right Imam” is one of the most significant Quranic concepts and has attracted the attention of various jurisprudential, theological, mystical, interpretative, narrative and historical schools. However, it has not been dealt with by a semantic approach yet. Although the word “Imam” with the meaning of right leader has been used in 5 ranks in the Holy Quran, it could be said t...
متن کاملImam Khomeini`s Quranic Interpretation and Hermeneutics
The words commentary and hermeneutics have been of different meanings for commentators. In the past, they were used interchangeably but in the present, the former means discovering the meaning of words or Divine goal and the latter using the word contrary to its appear meaning or understanding its inner meaning. Unlike these two groups, Imam Khomeini has considered interpretation as discovering...
متن کاملA semantic partition based text mining model for document classification
Feature Extraction is a mechanism used to extract key phrases from any given text documents. This extraction can be weighted, ranked or semantic based. Weighted and Ranking based feature extraction normally assigns scores to extracted words based on various heuristics. Highest scoring words are seen as important. Semantic based extractions normally try to understand word meanings, and words wit...
متن کاملInvestigate the Performance of Document Clustering Approach Based on Association Rules Mining
The challenges of the standard clustering methods and the weaknesses of Apriori algorithm in frequent termset clustering formulate the goal of our research. Based on Association Rules mining, an efficient approach for Web Document Clustering (ARWDC) has been devised. An efficient Multi-Tire Hashing Frequent Termsets algorithm (MTHFT) has been used to improve the efficiency of mining association...
متن کامل